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Period for Reply 

A SHORTENED STATUTORY PERIOD FOR REPLY IS SET TO EXPIRE 3 MONTH(S) FROM 
THE MAILING DATE OF THIS COMMUNICATION. 

- Extensions of time may be available under the provisions of 37 CFR 1 .136(a). In no event, however, may a reply be timely filed 
after SIX (6) MONTHS from the mailing date of this communication. 

- If the period for reply specified above is less than thirty (30) days, a reply within the statutory minimum of thirty (30) days will be considered timely. 

- If NO period for reply is specified above, the maximum statutory period will apply and will expire SIX (6) MONTHS from the mailing date of this communication. 

- Failure to reply within the set or extended period for reply will, by statute, cause the application to become ABANDONED (35 U.S.C. § 133). 
Any reply received by the Office later than three months after the mailing date of this communication, even if timely filed, may reduce any 
earned patent term adjustment. See 37 CFR 1.704(b). 

Status 

1 )£3 Responsive to communication(s) filed on 5/18/2004 . 
2a)l3 This action is FINAL. 2b)D This action is non-final. 

3) Q Since this application is in condition for allowance except for formal matters, prosecution as to the merits is 

closed in accordance with the practice under Ex parte Quay/e, 1935 CD. 11, 453 O.G. 213. 

Disposition of Claims 

4) |3 Claim(s) 1-25 is/are pending in the application. 

4a) Of the above claim(s) is/are withdrawn from consideration. 

5) D Claim(s) is/are allowed. 

6) M Claim(s) 1-25 is/are rejected. 

7) D Claim(s) is/are objected to. 

8) D Claim(s) are subject to restriction and/or election requirement. 

Application Papers 

9) D The specification is objected to by the Examiner. 

10) E3 The drawing(s) filed on 3/29/2001 is/are: a)S accepted or b)D objected to by the Examiner. 

Applicant may not request that any objection to the drawing(s) be held in abeyance. See 37 CFR 1.85(a). 
Replacement drawing sheet(s) including the correction is required if the drawing(s) is objected to. See 37 CFR 1.121(d). 

1 1) D The oath or declaration is objected to by the Examiner. Note the attached Office Action or form PTO-152. 

Priority under 35 U.S.C. § 119 

12) D Acknowledgment is made of a claim for foreign priority under 35 U.S.C. § 1 19(a)-(d) or (f). 
a)D All b)D Some * c)D None of: 

1 .□ Certified copies of the priority documents have been received. 

2. D Certified copies of the priority documents have been received in Application No. . 

3. D Copies of the certified copies of the priority documents have been received in this National Stage 

application from the International Bureau (PCT Rule 17.2(a)). 
* See the attached detailed Office action for a list of the certified copies not received. 
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DETAILED ACTION 



Response to Amendment 



1 . In response to the office action from 12/1 8/2003, the applicant has submitted an 
amendment, filed 5/18/2004, amending claims 1,13, and 25, while arguing to traverse the art 
rejection based on the amended limitation regarding utilizing linguistic features in building a 
speech recognition model (Amendment, Page 9). The applicant's arguments have been 
considered but are moot in view of the new grounds of rejection, necessitated by the amendment 
to the claims. 



Claim Rejections - 35 USC §103 



2. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in 
section 102 of this title, if the differences between the subject matter sought to be patented and the prior art are 
such that the subject matter as a whole would have been obvious at the time the invention was made to a person 
having ordinary skill in the art to which said subject matter pertains. Patentability shall not be negatived by the 
manner in which the invention was made. 

3. Claims 1- 25 are rejected under 35 U.S.C. 103(a) as being unpatentable over Nouza 

( "Feature Selection Methods for Hidden Markov Model-based Speech Recognition ") in view of 
Eide et al ("A Linguistic Feature Representation of the Speech Waveform, " J 993). 
With respect to Claims 1 and 13, Nouza recites: 
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Obtaining speech input data (HMM and DTW speech recognition systems, Page 188, Col 
1, Lines 5-7; Inherently, speech data would have to be received in order for speech to be 
recognized by the recognition system.); 

Building a model for each feature of an original set of features (parameters used to 
distinguish models of different speech objects in the form of Gaussian mixture pdfs, Page 187, 
Col 1, Lines 5-9, and evaluated for individual feature contributions for speech unit 
classification, Page 188, Col 1, Lines 11-13); 

Ranking the features (feature significance factor that can be used for ordering features, 
Page 188, Col 1, last paragraph - Col 2, first paragraph); and 

Building a model for each of a pre-selected number N of the ranked features (reducing 
the size of feature vectors used in speech processing, Abstract, Lines 5-6, and identifying correct 
and incorrect speech models based upon those features, Page 188, Lines 26-32. It would be 
inherent, upon selection of principal components with the largest amount of variance used for 
separating hypothesis choices, that the principal component features being of a highest 
likelihood to represent a particular state within a HMM be used to remodel a most likely HMM 
candidate for speech recognition.). 

Nouza does not teach the use of linguistic features in building speech models for 
recognition, however, Eide discloses a method for creating speech recognition models using 
speech features that have been linguistically classified (Pages 483-484, Section 1, and Table 1). 

Nouza and Eide are analogous art because they are from a similar field of endeavor in 
speech recognition feature processing. Thus, it would have been obvious to a person of ordinary 
skill in the art, at the time of invention, to combine the use of linguistic features in the creation of 




Application/Control Number: 09/821 ,404 Page 4 

Art Unit: 2655 

a speech model for recognition as taught by Eide with the speech recognition system utilizing 
feature selection as taught by Nouza to improve recognition accuracy through contextual 
information provided by linguistic features, thus implementing a means of keyword spotting 
(Eide, Page 485-486, Section 3 and Table 5). Therefore, it would have been obvious to combine 
Eide with Nouza for the benefit of improving recognition accuracy through the use of linguistic 
features. 

With respect to Claims 2 and 14, Nouza further discloses: 

The method and apparatus according to claims 1 and 13, respectively, wherein said step 
of building a model for each of a pre-selected number N of the ranked features comprises 
building a model for the top N ranked features (reducing the size of feature vectors used in 
speech processing, Abstract, Lines 5-6, feature significance factor that can be used for ordering 
features, Page 188, CoL 1, last paragraph - Col 2, first paragraph, and identifying correct and 
incorrect speech models based upon those features, Page 188, Lines 26-32. It would be 
inherent, upon selection of principal components with the largest amount of variance from an 
ordered component set used for separating hypothesis choices, that the principal component 
features being of a highest likelihood to represent a particular state within a HMM be used to 
remodel a most likely HMM candidate for speech recognition.) 

With respect to Claims 3 and 15, Nouza additionally recites: 

The method and apparatus according to claims 1 and 13, respectively, further comprising 
the step of compiling a confusion matrix for each feature of the original set of features 
subsequent to said step of building a model for each feature of an original set of features 
(covariance matrix used to evaluate the contributions of a feature in speech classification and to 
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determine whether a particular speech model is correct or incorrect, Page 188, Col. 1, Lines 11- 
32). 

With respect to Claims 4, 5, 16, and 17, Nouza further discloses: 
The method and apparatus according to claims 3 and 15 and claims 4 and 16, respectively, 
wherein said step of compiling a confusion matrix comprises computing a score for each feature 
based on the likelihood, as a log-likelihood as per Claim 5, of its presence in a frame of the 
speech input data (contribution of a feature within a covariance matrix in identifying a speech 
unit using a particular speech model, which is represented by a log-likelihood score, Page 755, 
Col 7, Lines 13-23). 

With respect to Claim 6 and 18, Nouza teaches the method and system of feature 
selection in recognizing a speech unit, utilizing a confusion matrix used to evaluate the 
contributions of a feature in speech classification and to determine whether a particular speech 
model is correct or incorrect as applied to Claims 3 and 15. Nouza does not specifically teach 
comparing likelihood scores to a predetermined threshold as a means of detecting whether a 
speech feature is useful in picking a correct classification however, Eide discloses: 

Compiling a confusion matrix further comprises comparing each score of each feature 
with a threshold (detection of a particular linguistic feature within a phoneme that would 
inherently require some type of threshold comparison to determine the presence of such a 
feature, Pages 484-484, Section 1 and Tables 1-4). 

Nouza and Eide are analogous art because they are from a similar field of endeavor in 
speech recognition feature processing. Thus, it would have been obvious to a person of ordinary 
skill in the art, at the time of invention, to combine the use of a threshold comparison in 
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determining the presence of a particular linguistic feature within a phoneme as suggested by Eide 
with the method and system of feature selection in recognizing a speech unit, utilizing a 
confusion matrix used to evaluate the contributions of a feature in speech classification and to 
determine whether a particular speech model is correct or incorrect as taught by Nouza to 
provide a well-known and convenient means of detecting if a linguistic feature is present in 
picking a correct phoneme classification through threshold comparison. Therefore, it would 
have been obvious to combine Eide with Nouza for the benefit of detecting the presence of a 
particular linguistic feature for phoneme classification. 

With respect to Claims 7 and 19, Eide additionally discloses: 

Calculating mutual information between truth and labels for each feature (determination 
of the absence or presence of a particular speech feature designated by a "+ " or "- " in 
phoneme classification, Pages 483-484, Section 1, and Tables 1-4). 

With respect to Claims 8 and 20, Eide further recites: 

Ranking the mutual information calculated in compiling the confusion matrix 
(determination of the most-likely linguistic classes used to describe a phoneme, which would 
inherently require a step of linguistic feature ranking, Pages 483-484, and Tables 1-4). 

With respect to Claims 9, 11, 21, and 23, Eide additionally recites: 

Partitioning the speech input data in parallel, once fore each linguistic feature (dividing 
speech training data according to linguistic feature truth labels, Page 483, Section 1)\ and 

Producing an observation vector (calculating attribute vectors, Page 483, Section 1). 

With respect to Claims 10, 12, 22, and 24, Eide further discloses: 
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Portioning data in parallel from the observation vector, once for each feature (dividing 
attribute vectors into feature-present and feature-absent sets for all linguistic features, Page 484, 
Section 1); and 

Producing final observations (final determination of whether a particular linguistic 
feature is present or absent in speech training data, Page 484, Section 1). 

With respect to Claim 25, Nouza in view of Eide teaches the method of linguistic feature 
selection in recognizing a speech unit as applied to Claim 1. While Nouza in view of Eide does 
not teach it, it would have been obvious to one of ordinary skill in the art, at the time of 
invention, to implement the method taught by Nouza in view of Eide using a computer program 
contained on a computer storage device, since computers are conveniently used and their 
programs easily updated for performing speech recognition operations, while a storage device 
would offer a means of storing any training databases or other necessary stored information. 
Therefore, it would have been obvious to one of ordinary skill in the art, at the time of invention, 
to implement a linguistic feature selection method using a computer program transferable 
between various machines through the use of a storage device, thus increasing method 
adaptability, to obtain the invention as specified in Claim 25. 



Conclusion 



4. The prior art made of record and not relied upon is considered pertinent to applicant's 
disclosure: 
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• Glass (U.S. Patent: 5,822, 729)- discloses a speech recognizer that groups training 
data into classes based upon linguistic features. 

• Tzirkel-Hancock (U.S. Patent: 5,913,188)- teaches a method of classifying 
training vectors according to linguistic features. 

• Leung et al ("Phonetic Classification Using Multi-Layer Perceptrons, " 1990)- 
teaches a phoneme classification method utilizing linguistic speech features 
capable of providing context information. 

• Abrash et al ("Incorporating Linguistic Features in a Hybrid HMM/MLP Speech 
Recognizer, " 1994)- teaches the use of a linguistic feature classifier in a speech 
recognizer. 

5. Applicant's amendment necessitated the new ground(s) of rejection presented in this 
Office action. Accordingly, TfflS ACTION IS MADE FINAL. See MPEP § 706.07(a). 
Applicant is reminded of the extension of time policy as set forth in 37 CFR 1 .136(a). 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within TWO 
MONTHS of the mailing date of this final action and the advisory action is not mailed until after 
the end of the THREE-MONTH shortened statutory period, then the shortened statutory period 
will expire on the date the advisory action is mailed, and any extension fee pursuant to 37 
CFR 1 .136(a) will be calculated from the mailing date of the advisory action. In no event, 
however, will the statutory period for reply expire later than SIX MONTHS from the date of this 
final action. 
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6. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to James S. Wozniak whose telephone number is (703) 305-8669 
and email is James.Wozniak@uspto.gov. The examiner can normally be reached on Mondays- 
Fridays, 8:30-4:30. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Doris To can be reached at (703) 305-4827. The fax/phone number for the 
Technology Center 2600 where this application is assigned is (703) 872-9306. 

Any inquiry of a general nature or relating to the status of this application or proceeding 
should be directed to the technology center receptionist whose telephone number is (703) 306- 
0377. 



James S. Wozniak 
7/15/2004 




